智能论文笔记

Recent Advances and New Frontiers in Spiking Neural Networks

Duzhen Zhang , Tielin Zhang , Shuncheng Jia , Qingyu Wang , Bo Xu

分类：神经与进化计算 | 人工智能

2022-03-12

近年来，尖峰神经网络（SNN）由于其丰富的时空动力学，各种编码方法和事件驱动的特征而自然拟合神经形态硬件，因此在脑启发的智能上受到了广泛的关注。随着SNN的发展，受到脑科学成就启发和针对人工通用智能的新兴研究领域的脑力智能变得越来越热。本文回顾了最新进展，并讨论了来自五个主要研究主题的SNN的新领域，包括基本要素（即尖峰神经元模型，编码方法和拓扑结构），神经形态数据集，优化算法，软件，软件和硬件框架。我们希望我们的调查能够帮助研究人员更好地了解SNN，并激发新作品以推进这一领域。

translated by 谷歌翻译

Universal adversarial perturbation for remote sensing images

Qingyu Wang , Guorui Feng , Zhaoxia Yin , Bin Luo

分类：计算机视觉

2022-02-22

Recently, with the application of deep learning in the remote sensing image (RSI) field, the classification accuracy of the RSI has been dramatically improved compared with traditional technology. However, even the state-of-the-art object recognition convolutional neural networks are fooled by the universal adversarial perturbation (UAP). The research on UAP is mostly limited to ordinary images, and RSIs have not been studied. To explore the basic characteristics of UAPs of RSIs, this paper proposes a novel method combining an encoder-decoder network with an attention mechanism to generate the UAP of RSIs. Firstly, the former is used to generate the UAP, which can learn the distribution of perturbations better, and then the latter is used to find the sensitive regions concerned by the RSI classification model. Finally, the generated regions are used to fine-tune the perturbation making the model misclassified with fewer perturbations. The experimental results show that the UAP can make the classification model misclassify, and the attack success rate of our proposed method on the RSI data set is as high as 97.09%.

translated by 谷歌翻译

PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction

Qingyu Wang , Baojian Ma , Wei Liu , Mingzhao Lou , Mingchuan Zhou , Huanyu Jiang , Yibin Ying

分类：计算机视觉

2021-11-30

立体声匹配是计算机愿景中的一个重要任务，这些任务是几十年来引起了巨大的研究。虽然在差距准确度，密度和数据大小方面，公共立体声数据集难以满足模型的要求。在本文中，我们的目标是解决数据集和模型之间的问题，并提出了一个具有高精度差异地面真理的大规模立体声数据集，名为Plantstereo。我们使用了半自动方式来构造数据集：在相机校准和图像配准后，可以从深度图像获得高精度视差图像。总共有812个图像对覆盖着多种植物套装：菠菜，番茄，胡椒和南瓜。我们首先在四种不同立体声匹配方法中评估了我们的Plandstereo数据集。不同模型和植物的广泛实验表明，与整数精度的基础事实相比，Plantstereo提供的高精度差异图像可以显着提高深度学习模型的培训效果。本文提供了一种可行和可靠的方法来实现植物表面密集的重建。 PlantSereo数据集和相对代码可用于：https://www.github.com/wangqingyu985/plantstereo

translated by 谷歌翻译

Toward Human-Like Evaluation for Natural Language Generation with Error Analysis

Qingyu Lu , Liang Ding , Liping Xie , Kanjian Zhang , Derek F. Wong , Dacheng Tao

分类：自然语言处理

2022-12-20

The state-of-the-art language model-based automatic metrics, e.g. BARTScore, benefiting from large-scale contextualized pre-training, have been successfully used in a wide range of natural language generation (NLG) tasks, including machine translation, text summarization, and data-to-text. Recent studies show that considering both major errors (e.g. mistranslated tokens) and minor errors (e.g. imperfections in fluency) can produce high-quality human judgments. This inspires us to approach the final goal of the evaluation metrics (human-like evaluations) by automatic error analysis. To this end, we augment BARTScore by incorporating the human-like error analysis strategies, namely BARTScore++, where the final score consists of both the evaluations of major errors and minor errors. Experimental results show that BARTScore++ can consistently improve the performance of vanilla BARTScore and outperform existing top-scoring metrics in 20 out of 25 test settings. We hope our technique can also be extended to other pre-trained model-based metrics. We will release our code and scripts to facilitate the community.

translated by 谷歌翻译

AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning

Ling Luo , Chih-Hsuan Wei , Po-Ting Lai , Robert Leaman , Qingyu Chen , Zhiyong Lu

分类：自然语言处理 | 人工智能

2022-11-30

Biomedical named entity recognition (BioNER) seeks to automatically recognize biomedical entities in natural language text, serving as a necessary foundation for downstream text mining tasks and applications such as information extraction and question answering. Manually labeling training data for the BioNER task is costly, however, due to the significant domain expertise required for accurate annotation. The resulting data scarcity causes current BioNER approaches to be prone to overfitting, to suffer from limited generalizability, and to address a single entity type at a time (e.g., gene or disease). We therefore propose a novel all-in-one (AIO) scheme that uses external data from existing annotated resources to improve generalization. We further present AIONER, a general-purpose BioNER tool based on cutting-edge deep learning and our AIO schema. We evaluate AIONER on 14 BioNER benchmark tasks and show that AIONER is effective, robust, and compares favorably to other state-of-the-art approaches such as multi-task learning. We further demonstrate the practical utility of AIONER in three independent tasks to recognize entity types not previously seen in training data, as well as the advantages of AIONER over existing methods for processing biomedical text at a large scale (e.g., the entire PubMed data).

translated by 谷歌翻译

Vega-MT: The JD Explore Academy Translation System for WMT22

Changtong Zan , Keqin Peng , Liang Ding , Baopu Qiu , Boan Liu , Shwai He , Qingyu Lu , Zheng Zhang , Chuang Liu , Weifeng Liu

分类：自然语言处理

2022-09-20

我们描述了JD Explore Academy对WMT 2022共享的一般翻译任务的提交。我们参加了所有高资源曲目和一条中型曲目，包括中文英语，德语英语，捷克语英语，俄语 - 英语和日语英语。我们通过扩大两个主要因素，即语言对和模型大小，即\ textbf {vega-mt}系统来推动以前的工作的极限 - 进行翻译的双向培训。至于语言对，我们将“双向”扩展到“多向”设置，涵盖所有参与语言，以利用跨语言的常识，并将其转移到下游双语任务中。至于型号尺寸，我们将变压器限制到拥有近47亿参数的极大模型，以完全增强我们VEGA-MT的模型容量。此外，我们采用数据增强策略，例如单语数据的循环翻译以及双语和单语数据的双向自我训练，以全面利用双语和单语言数据。为了使我们的Vega-MT适应通用域测试集，设计了概括调整。根据受约束系统的官方自动分数，根据图1所示的sacrebleu，我们在{zh-en（33.5），en-zh（49.7）（49.7），de-en（33.7）上获得了第一名-de（37.8），CS-EN（54.9），En-CS（41.4）和En-Ru（32.7）}，在{ru-en（45.1）和Ja-en（25.6）}和第三名上的第二名和第三名在{en-ja（41.5）}上； W.R.T彗星，我们在{zh-en（45.1），en-zh（61.7），de-en（58.0），en-de（63.2），cs-en（74.7），ru-en（ru-en（ru-en）上，我们获得了第一名64.9），en-ru（69.6）和en-ja（65.1）}，分别在{en-cs（95.3）和ja-en（40.6）}上的第二名。将发布模型，以通过GitHub和Omniforce平台来促进MT社区。

translated by 谷歌翻译

Comprehensive identification of Long Covid articles with human-in-the-loop machine learning

Robert Leaman , Rezarta Islamaj , Alexis Allot , Qingyu Chen , W. John Wilbur , Zhiyong Lu

分类：机器学习

2022-09-16

Covid-19幸存者中很大一部分经历了经常影响日常生活的持续多系统症状，这种疾病被称为SARS-COV-2感染的长期或急性后静脉曲张。但是，识别长期的卷文章是具有挑战性的，因为文章是指使用各种较少常见的术语或根本不使用命名的条件。我们开发了一个迭代的人类机器学习框架，旨在有效利用可用的数据并最有效地利用人类标签。具体而言，我们的方法将数据编程与主动学习结合到了强大的集合模型中。在保留集上评估我们的模型表明了其他方法的灵敏度的三倍。我们将模型应用于PubMed来创建长期的共同集合，并证明（1）最长的卷vid文章在命名该条件时并不是用任何名称（2）来指代长的covid，在生物医学文献中最常使用的名称是长的，并且（3）长互联物与各种身体系统中的疾病有关。长期COVID系列每周更新，可在Litcovid门户网站上进行在线搜索：https：//www.ncbi.nlm.nih.gov/research/coronavirus/docsum/docsum?filters=e_condition.longcondition.longcovid.longcovid

translated by 谷歌翻译

Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites

Simiao Zuo , Qingyu Yin , Haoming Jiang , Shaohui Xi , Bing Yin , Chao Zhang , Tuo Zhao

分类：机器学习

2022-09-15

电子商务查询通常简短而模棱两可。因此，查询理解通常使用查询重写来消除用户输入查询。在使用电子商务搜索工具时，用户倾向于在购买之前输入多个搜索，我们称之为上下文。这些历史搜索包含有关用户真正购物意图的上下文见解。因此，对此类上下文信息进行建模对于更好的查询重写模型至关重要。但是，现有的查询重写模型忽略了用户的历史行为，而仅考虑即时搜索查询，这通常是一个简短的字符串，提供有关真实购物意图的有限信息。我们建议一个端到端的上下文感知查询重写模型来弥合此差距，从而考虑了搜索上下文。具体而言，我们的模型使用历史记录搜索查询及其包含的单词构建了会话图。然后，我们采用图形注意机制，该机制对交叉关系进行建模并计算会话的上下文信息。随后，模型通过使用聚合网络将上下文信息与即时搜索查询组合来计算会话表示。然后将会话表示形式解码以生成重写的查询。从经验上讲，我们证明了我们方法对各种指标下最先进的方法的优越性。在从线购物平台的内部数据上，通过介绍上下文信息，我们的模型在MRR（平均值等级）指标下取得了11.6％的改善，并在HIT@16度量指标（命中率指标）下提高了20.1％使用最佳基线方法（基于变压器的模型）。

translated by 谷歌翻译

DiP-GNN: Discriminative Pre-Training of Graph Neural Networks

Simiao Zuo , Haoming Jiang , Qingyu Yin , Xianfeng Tang , Bing Yin , Tuo Zhao

分类：机器学习

2022-09-15

已经提出了图形神经网络（GNN）预训练方法来增强GNN的能力。具体而言，首先在大规模的未标记图上预先训练GNN，然后在单独的小标记图上进行微调，以用于下游应用程序，例如节点分类。一种流行的预训练方法是掩盖一部分边缘，并接受了GNN的培训以恢复它们。但是，这种生成方法遭受了图不匹配。也就是说，输入到GNN偏离原始图的蒙版图。为了减轻此问题，我们提出了DIP-GNN（图神经网络的歧视性预训练）。具体来说，我们训练一个发电机以恢复蒙版边缘的身份，同时，我们训练一个判别器，以区分生成的边缘与原始图的边缘。在我们的框架中，鉴别器看到的图形更好地匹配原始图，因为生成器可以恢复蒙版边缘的一部分。大规模同质和异质图的广泛实验证明了该框架的有效性。

translated by 谷歌翻译

MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

Qingyu Yang , Guang Yang , Jun Jiang , Chongyi Li , Ruicheng Feng , Shangchen Zhou , Wenxiu Sun , Qingpeng Zhu , Chen Change Loy , Jinwei Gu

分类：计算机视觉

2022-09-15

随着移动平台上对计算摄影和成像的需求不断增长，在相机系统中开发和集成了高级图像传感器与新型算法的发展。但是，缺乏用于研究的高质量数据以及从行业和学术界进行深入交流的难得的机会限制了移动智能摄影和成像（MIPI）的发展。为了弥合差距，我们介绍了第一个MIPI挑战，包括五个曲目，这些曲目着重于新型图像传感器和成像算法。在本文中，引入了RGBW关节Remosaic和Denoise，这是五个曲目之一，在全面分辨率上进行了RGBW CFA插值的插值。为参与者提供了一个新的数据集，其中包括70（培训）和15个（验证）高质量RGBW和拜耳对的场景。此外，对于每个场景，在0dB，24dB和42dB上提供了不同噪声水平的RGBW。所有数据均在室外和室内条件下使用RGBW传感器捕获。最终结果是使用PSNR，SSIM，LPIPS和KLD在内的客观指标评估的。本文提供了此挑战中所有模型的详细描述。有关此挑战的更多详细信息以及数据集的链接，请访问https://github.com/mipi-challenge/mipi2022。

translated by 谷歌翻译